On-Line Data Archives

نویسندگان

  • Kenneth A. Hawick
  • Paul D. Coddington
  • Heath A. James
  • Craig J. Patten
چکیده

Digital libraries and other large archives of electronically retrievable and manipulable material are becoming widespread in both commercial and scientific arenas. Advances in networking technologies have led to a greater proliferation of wide-area distributed data warehousing. This presents particular challenges associated with distributed data management. We review the available tools and technologies for supporting distributed on-line data archives and explain the key concept of “active” data archives, in which data can be processed on-demand prior to delivery. We present a summary of our program of work in developing wide-area data warehousing software infrastructure. Our system primarily targets geographically distributed archives of large scientific data sets, such as satellite image data, that are stored hierarchically on disk arrays and tape silos and accessed by a variety of scientific and decision support applications. We discuss the issues faced in building such an infrastructure, and the key areas that are the subject of current research, such as efficient bulk data storage, processing and delivery. Interoperability is a major issue for distributed data archives, and requires standards for server interfaces and metadata. There is currently considerable activity in developing such standards for different application areas. We provide an overview of some of this work, and of our experiences in implementing an active data archive of satellite images based on evolving interface standards for accessing and processing geospatial image data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speechfind: an experimental on-line spoken document retrieval system for historical audio archives

In this study, we present the SpeechFind system, an experimental on-line spoken document retrieval system for historical audio archives. As part of an on-going U.S. NSF Digital Library Initiative project, entitled the National Gallery of the Spoken Word (NGSW), SpeechFind is intended to serve as an audio index and search engine for spoken word collections spanning the 20th century with as much ...

متن کامل

Analysis of the Request Patterns to the NSSDC On - line Archive

The successful implementation of mass storage archives require careful attention to performance optimizations, to ensure that the system can handle the ooered load. However, performance optimizations require an understanding of user access patterns. Since on-line archives and digital libraries are so new, little information is available. has run an on-line mass storage archive of space data, th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001